Fix classification report if dataset has no labels #3375

alanakbik · 2023-11-12T20:11:31Z

If the test dataset has no labels, and the model makes no predictions, no evaluation numbers (precision/recall/F1) can be computed. This causes errors in cases such as this:

import flair
from flair.data import Corpus
from flair.datasets import ColumnCorpus, WNUT_17
from flair.embeddings import TransformerWordEmbeddings
from flair.models import SpanClassifier
from flair.trainers import ModelTrainer

flair.set_seed(123)

corpus: Corpus = WNUT_17().downsample(0.01)

label_dictionary = corpus.make_label_dictionary("ner", add_unk=True)

embeddings = TransformerWordEmbeddings(
    model="distilbert-base-uncased", layers="-1", subtoken_pooling="first", fine_tune=True, use_context=True
)

tagger = SpanClassifier(embeddings=embeddings, label_dictionary=label_dictionary)

trainer = ModelTrainer(tagger, corpus)

trainer.fine_tune(
    "./results/",
    max_epochs=1,
)

This PR slightly refactors the evaluate() code in the DefaultClassifier such that the error handling for this case becomes more explicit. This also fixes the above error.

flair/nn/model.py

alanakbik added 3 commits November 12, 2023 21:03

Fix classification report for datasets without labels

4f060c2

Further simplify code

d9c7be6

Further simplify code

f21edf3

alanakbik changed the title ~~Fix classification report~~ Fix classification report if dataset has no labels Nov 12, 2023

alanakbik requested a review from helpmefindaname November 12, 2023 20:11

alanakbik added 2 commits November 12, 2023 21:54

Further simplify code

d292c2d

Fix mypy

73e1f5c

helpmefindaname requested changes Nov 13, 2023

View reviewed changes

flair/nn/model.py Outdated Show resolved Hide resolved

Fix outputs

6e00253

helpmefindaname approved these changes Dec 4, 2023

View reviewed changes

alanakbik merged commit 28c9a83 into master Dec 5, 2023
1 check passed

alanakbik deleted the fix_classification_report branch December 5, 2023 04:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix classification report if dataset has no labels #3375

Fix classification report if dataset has no labels #3375

alanakbik commented Nov 12, 2023

Fix classification report if dataset has no labels #3375

Fix classification report if dataset has no labels #3375

Conversation

alanakbik commented Nov 12, 2023